Using Sequence Motifs for Enhanced Neural Network Prediction of Protein Distance Constraints

نویسندگان

  • Jan Gorodkin
  • Ole Lund
  • Claus A. F. Andersen
  • Søren Brunak
چکیده

Correlations between sequence separation (in residues) and distance (in Angstrom) of any pair of amino acids in polypeptide chains are investigated. For each sequence separation we define a distance threshold. For pairs of amino acids where the distance between C alpha atoms is smaller than the threshold, a characteristic sequence (logo) motif, is found. The motifs change as the sequence separation increases: for small separations they consist of one peak located in between the two residues, then additional peaks at these residues appear, and finally the center peak smears out for very large separations. We also find correlations between the residues in the center of the motif. This and other statistical analysis are used to design neural networks with enhanced performance compared to earlier work. Importantly, the statistical analysis explains why neural networks perform better than simple statistical data-driven approaches such as pair probability density functions. The statistical results also explain characteristics of the network performance for increasing sequence separation. The improvement of the new network design is significant in the sequence separation range 10-30 residues. Finally, we find that the performance curve for increasing sequence separation is directly correlated to the corresponding information content. A WWW server, distanceP, is available at http://www.cbs.dtu.dk/services/distanceP/.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Protein Structure Assembly from Knowledge of β -Sheet Motifs and Secondary Structure

We develop and test a new hierarchical approach for the prediction of protein structure. An algorithm is described to assemble the 3D fold of a protein starting from its secondary structure and β-sheet topology. Reconstruction is carried out by energy minimization of a reduced protein model, where β-partners are derived from appropriate distance constraints imposed by the knowledge of βsheet mo...

متن کامل

Prediction of protein supersecondary structures based on the artificial neural network method.

The sequence patterns of 11 types of frequently occurring connecting peptides, which lead to a classification of supersecondary motifs, were studied. A database of protein supersecondary motifs was set up. An artificial neural network method, i.e. the back propagation neural network, was applied to the predictions of the supersecondary motifs from protein sequences. The prediction correctness r...

متن کامل

Prediction of the Weight and Number of Eggs in Mazandaran Native Fowl Using Artificial Neural Network

Traditional poultry production has changed to a considerable industry after few decades. Now, poultry industry is one of the main sectors to obtain the required protein for human consumption. Prediction of the weight and number of eggs according to economic traits can improve the efficiency of production and the profit of producers. In present study, the weight and number of eggs in Mazandaran ...

متن کامل

Application of Linear Regression and Artificial NeuralNetwork for Broiler Chicken Growth Performance Prediction

This study was conducted to investigate the prediction of growth performance using linear regression and artificial neural network (ANN) in broiler chicken. Artificial neural networks (ANNs) are powerful tools for modeling systems in a wide range of applications. The ANN model with a back propagation algorithm successfully learned the relationship between the inputs of metabolizable energy (kca...

متن کامل

Prediction of the Weight and Number of Eggs in Mazandaran Native Fowl Using Artificial Neural Network

Traditional poultry production has changed to a considerable industry after few decades. Now, poultry industry is one of the main sectors to obtain the required protein for human consumption. Prediction of the weight and number of eggs according to economic traits can improve the efficiency of production and the profit of producers. In present study, the weight and number of eggs in Mazandaran ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Proceedings. International Conference on Intelligent Systems for Molecular Biology

دوره   شماره 

صفحات  -

تاریخ انتشار 1999